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This listing of claims will replace all prior versions, and listings, of claims in the application. 
Listing of Claims: 

Claims 1-17. Canceled. 

18. (New) A method of controlling a system to optimize an objective function 
thereof, the system being capable of performing a plurality of candidate actions and being 
capable of monitoring response performances of a performance of a respective candidate 
action, the method comprising the steps of:- 

a) monitoring response performance of a respective candidate action that is chosen to 
be performed; 

b) storing, according to candidate action performed, a representation of said 
monitored response performance; 

c) choosing which of the plurality of candidate actions is next performed so as to 
optimize said objective function by assessing, using the probability distribution of the 
response performance of all of said plurality of candidate actions, which candidate action is 
likely to result in the lowest expected growth in regret after the chosen candidate action is 
performed; and 

d) repeating steps a) to c); 

where regret is a term used for the shortfall in response performance between always 
performing the true best candidate action and actually performing the candidate actions 
chosen to be performed. 

19. (New) A method according to claim 18 wherein step c) includes assessing 
which candidate action is likely to result in the lowest expected growth in regret on the basis 
of a true best candidate action which has the mean of said probability distribution. 

20. (New) A method according to claim 18 wherein step c) includes evaluating the 
cost or losses associated with presenting a lower performing candidate action and the gain or 
benefit associated with knowing the tme position of the current best observed candidate 
action on said probability distribution. 
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21. (New) A method according to claim 20 wherein step c) includes assessing 
which candidate action is likely to result in the lowest expected growth in regret according to 
an assumption that the current best observed candidate action is assumed to have zero 
uncertainty aroimd its mean or expected response performance. 

22. (New) A method according to claim 18 wherein step c) includes assessing 
which candidate action is likely to result in the lowest expected growth in regret according to 
an assumption of a Student's distribution and evaluation of Student's t parameters as the basis 
for estimating probabilities of xmequal or equal response states between the candidate action 
with the current expected best response performance and any other candidate action. 

23. (New) A method according to claim 18 wherein step c) includes using a 
Monte Carlo algorithm to provide imderstanding of the probability distribution of the 
response performance of all of the plurality of candidate actions and either chooses the 
candidate action that contributes most to the expected regret estimate, or chooses a candidate 
action with probability proportional to its contribution to the expected regret estimate. 

24. (New) A method according to claim 18 further comprising the step of:- 

d) applying a temporal depreciation factor to the stored representations of the 
response performance in order to depreciate the significance of the stored representations 
over time. 

25. (New) A method according to claim 24 wherein step d) includes applying, for 
each candidate action, a different temporal depreciation factor to the stored representations of 
the response perfomiance thereof 

26. (New) A method according to claim 1 8 further comprising the step of:- 

e) forcing the performance of each candidate action a minimimi nimiber of times or at 
a minimimi rate. 
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27. (New) A method of controlling a system according to claim 18 wherein the 
system comprises a robot. 

28. (New) A method of controlling a system having two or more ranks of control 
arranged in a hierarchy, wherein each rank of control has a respective objective function and 
is capable of performing a plurality of candidate actions for that rank of control in the 
hierarchy, wherein the candidate action of a rank of control can represent a lower rank of 
control in the hierarchy, the method comprising controlling said rank of control according to 
the steps of the method of claim 18. 

29. (New) A method according to claim 28 wherein representations of said 
monitored response performance stored in step b) are shared with said rank of control. 

30. (New) A method according to claim 18 wherein the monitored response 
performance of a respective candidate action in step a) is stored in step b) in a form to enable 
sharing of the stored representation of said monitored response performance with another 
system. 

3 1 . (New) A system controlled according to the method of claim 18. 

32. (New) A robot controlled according to the method of claim 18. 

33. (New) A control apparatus operating according to the method of claim 1 8 to 
control a system. 
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